Value function approximation via low-rank models

نویسنده

  • Hao Yi Ong
چکیده

We propose a novel value function approximation technique for Markov decision processes. We consider the problem of compactly representing the state-action value function using a low-rank and sparse matrix model. The problem is to decompose a matrix that encodes the true value function into low-rank and sparse components, and we achieve this using Robust Principal Component Analysis (PCA). Under minimal assumptions, this Robust PCA problem can be solved exactly via the Principal Component Pursuit convex optimization problem. We experiment the procedure on several examples and demonstrate that our method yields approximations essentially identical to the true function.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Matrix Approximation under Local Low-Rank Assumption

Matrix approximation is a common tool in machine learning for building accurate prediction models for recommendation systems, text mining, and computer vision. A prevalent assumption in constructing matrix approximations is that the partially observed matrix is of low-rank. We propose a new matrix approximation model where we assume instead that the matrix is only locally of low-rank, leading t...

متن کامل

Survey on Probabilistic Models of Low-Rank Matrix Factorizations

Low-rank matrix factorizations such as Principal Component Analysis (PCA), Singular Value Decomposition (SVD) and Non-negative Matrix Factorization (NMF) are a large class of methods for pursuing the low-rank approximation of a given data matrix. The conventional factorization models are based on the assumption that the data matrices are contaminated stochastically by some type of noise. Thus t...

متن کامل

To “ Optimization via Low - Rank Approximation , with Applications to Community Detection in Networks ”

5.1. Proof of results in Section 3.1. Under degree-corrected block models, let us denote by Ā the conditional expectation of A given the degree parameters θ = (θ1, ..., θn) T . Note that if θi ≡ 1 then Ā = EA. Since Ā depends on θ, its eigenvalues and eigenvectors may not have a closed form. Nevertheless, we can approximate them using ρi and ūi from Lemma 3. To do so, we need the following lemma.

متن کامل

Regression-aware decompositions

Linear least-squares regression with a “design” matrix A approximates a given matrix B via minimization of the spectralor Frobenius-norm discrepancy ‖AX − B‖ over every conformingly sized matrix X. Another popular approximation is low-rank approximation via principal component analysis (PCA) — which is essentially singular value decomposition (SVD) — or interpolative decomposition (ID). Classic...

متن کامل

A Direct Solver for the Advection-diffusion Equation Using Green’s Functions and Low-rank Approximation

A new direct solution method for the advection-diffusion equation is presented. By employing a semi-implicit time discretisation, the equation is rewritten as a heat equation with source terms. The solution is obtained by discretely approximating the integral convolution of the associated Green’s function with advective source terms. The heat equation has an exponentially decaying Green’s funct...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1509.00061  شماره 

صفحات  -

تاریخ انتشار 2015